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IMMUNE-EVADING PROTEINS 

This invention relates to materials and methods for 
5 rendering proteins invisible to the cellular immune system. 

The use of recombinant DNA technology for the production of 
vaccines and transfer vectors for gene therapy has opened 
new possibilities for the prevention and cure of diseases. 

10 Most gene therapy approaches are based on the introduction 
in the somatic cells of the host of gene sequences from the 
same or other species. Amongst the genes of interest are 
those coding for active substances such as enzymes, 
hormones, cytokines, immunogenic proteins from viruses, 

15 bacteria or parasites, or intracellular control proteins 
that regulate gene expression and cell growth. 

Several methods have been developed to optimise the transfer 
of exogenous genes to somatic cells. Some of the most 

20 efficient methods employ recombinant vectors obtained by 
genetic manipulation of native viruses such as vaccinia 
viruses, adenoviruses or retroviruses. The viruses are 
generally rendered non-pathogenic by modification of 
endogenous genes, and the gene of interest is exchanged for 

25 ;3art of the virus genetic material. Alternatively, isolated 
DNA fragments coding for the gene of interest may be 
introduced into somatic cells via inert carriers such as 
liposome vectors, or via so-called "gene gun" methods 
whereby DNA/RNA coated carriers such as gold particles are 

3 0 shot at high pressure into dermal cells. 

One potential problem in gene therapy is that the host 
immune system may recognise as foreign therapeutic material 
that has been introduced into the host. For example, where 
3 5 a transfer vector carrying a gene encoding a therapeutic 
protein is used to introduce a gene into a mammal, a host 
immune response directed to structural or regulatory 
proteins of the transfer vector may be induced. Another 
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potential problem is induction of an immune response against 
the therapeutic protein itself. 

Foreign proteins are processed in host cells and presented 
5 at the cell surface in association with MHC class I or class 
II molecules. Cells expressing foreign proteins in this way 
are recognized by the immune system and eliminated. Strong 
memory responses are activated if the therapeutic protein is 
re-administered , resulting in significant impairment of 

10 therapeutic efficacy. This problem can be obviated by the 
administration of immunosuppressive drugs but this may have 
serious side effects such as increased risk of opportunistic 
infections and virus-induced lymphomas. There is therefore 
substantial interest in developing methods for selectively 

15 inhibiting immune responses that will favour the long term 
persistence of cells expressing potentially immunogenic 
proteins and eliminate the need for pharmacologically 
induced immunosuppression . 

20 Thus it is an object of the present invention to provide 
materials and methods for rendering proteins of interest 
invisible to the cellular immune response of the target host 
animal. 

25 The invention is based on the discovery that a glycine-rich 
repeat sequence, when inserted into a protein which is 
normally antigenic, confers upon the recombinant protein the 
ability to evade the immune system. Thus, the invention 
allows for selective inhibition of immune responses in that 

30 cells bearing the recombinant protein are not eliminated by 
the immune system. 

Therefore, according to a first aspect of the invention 
there is provided a method of rendering an antigenic protein 
35 invisible to the immune system by the insertion in said 
protein of a glycine-rich repeat sequence. 

The term "antigenic protein" refers to a protein which is 
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recognized by immunocompetent cells (i.e., cells of the 
immune system) . Recognition of immunocompetent cells is 
indicated when the protein or epitope triggers activation of 
such cells, as measured in terms of proliferation and/or 
5 induction of effector functions, e.g., as measured by 
production of lymphokines, cytokines, and/or killing of 
cells expressing the protein or epitope. Therefore, a 
protein is non-antigenic when is not recognized by 
immunocompetent cells, as explained above. 

10 

By "invisible to the immune system" or "ability to evade 
the immune system" it is meant that a protein has the 
ability to avoid or inhibit a T-cell immune response, such 
as a MHC class I or class II restricted CTL response, and 
15 thus can be determined, at least initially, via in vitro CTL 
assays. 

The term "glycine-rich repeat sequence" refers to any amino 
acid sequence containing "n" contiguous repeats of the 
20 generic "repeating unit": 

[Gly § X Gly b Y Gly c Z] 

wherein a is 1, 2, or 3; b is 1, 2, or 3; c is 1, 2, or 3; 

25 each of x, Y and Z is, independently, a hydrophobic or polar 
amino acid without a ring structure and having a side-chain 
of less than 3 atoms; and wherein "n" is an integer between 
1 and 66 inclusive. For the sake of clarity, each of a, b, 
c, X, Y, & Z need not be identical between each of the n 

30 repeating units. 

Preferably, each of X, Y, and Z is, independently, Ala, Ser, 
Val # He, Leu or Thr. More preferably, each of X, Y and Z 
is, independently, Ala or Ser. Even more preferably, each of 
35 X, Y and Z is Ala. It is also preferred that each of X and 
Y is not Met or Cys. 

Preferably, in 7 contiguous repeating units of the "n" 
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repeating units of the glycine-rich repeat sequence, a«2, 
b-1, & c-l. 

Preferably, in 9 contiguous repeating units of the glycine- 
5 rich repeat sequence, a=2, b=l, & c=2. 

More preferably, in 7 contiguous repeating units of the 
glycine-rich repeat sequence, a=2, b=l, & c=3 . 

10 More preferably, in 7 contiguous repeating units of the 
glycine-rich repeat sequence, a=2 , b=l, & c=2, and in a 
further 9 contiguous repeating units of the same glycine- 
rich repeat sequence, a=2, b=l f & c=3. 

15 Preferably, there are 28 repeating units in total. 

Most preferably, the glycine-rich repeat sequence is that 
shown in Figure 1, which is the EBNA1 glycine-rich sequence. 

2 0 It will also be understood that the glycine-rich repeat 

sequence may consist entirely or substantially of glycine 
amino acids. 

It will be understood that the present invention covers the 
25 situation where one or more glycine-rich repeat sequences 
are inserted into said protein. 

Furthermore, it is preferred that the protein retains its 
biological function after insertion of the glycine-rich 

3 0 repeat sequence. 

According to a second aspect of the invention, there is 
provided a protein formed by the method of the first aspect 
of the invention, containing a glycine-rich repeat sequence 
35 as described above, wherein said protein is invisible to the 
immune system. For convenience, this protein may be said to 
comprise an antigenic "core protein" and a "glycine-rich 
repeat sequence". 
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The glycine-rich repeat sequence may be fused to either the 
carboxy or amino terminal end of the core protein. 
Alternatively, it may be inserted anywhere within the core 
5 protein. 

Preferably, the glycine-rich repeat sequence is inserted 
within the core protein at a distance wherein either 
terminal residue of the glycine-rich repeat sequence is at 
10 a distance of between 1-3 00 residues from an epitope of the 
core protein. Preferably, this distance is between 1-200 
residues, more preferably 10-100 residues, and most 
preferably between 1-50 residues. 

15 As used herein, "epitope" refers to an antigenic stretch of 
about 8-2 0 amino acids within a protein that is 
immunologically recognized as foreign. 

The core protein may contain one or more epitopes. Where the 
20 core protein contains multiple epitopes, it is believed that 
all epitopes of the antigenic protein will be rendered non- 
antigenic according to the invention. 

Preferably, the protein of the second aspect of the 
25 invention is a therapeutic protein. Preferably, the 
therapeutic protein is selected from the group consisting of 
enzymes, cytokines, lympokines, cell adhesion molecules, 
costimulatory molecules, protein products of drug resistance 
genes or tumour suppressor genes. 

30 

In a further preferred embodiment of the present invention 
the protein of the second aspect of the invention is a 
marker. Preferred markers include neoR, the marker encoded 
by the multidrug resistance gene or the thymidine kinase 
35 gene, p galactosidase, dehydrof olate reductase (DHFR) and 
chloroamphenicolacety 1 transferase . 

In a further preferred embodiment of the present invention 
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the protein of the second aspect of the invention is an 
immunogenic structural or regulatory protein of a viral 
vector- Preferably, the vector is a vaccinia vector, a 
adenovirus vector, a herpesvirus vector, a Semliki Forest 
5 virus vector or any retrovirus vector developed for the 
purpose of gene therapy. 

In a further preferred embodiment of the present invention 
the protein of the second aspect of the invention is a 
10 component of a vaccine to be used for prevention or cure of 
viral, bacterial or parasitic infections, or a cancer 
vaccine. 

In a further prefered embodiment of the invention the 
15 glycine-rich repeat sequence as described above may be 
linked to an antigenic protein rendering the antigenic 
protein invisible to the immune system. The glycine-rich 
repeat sequence may be linked to the antigenic protein using 
any known techniques including chemical techniques. 

20 

According to a third aspect of the invention there is 
provided a nucleic acid encoding a glycine-rich repeat 
sequence as described above. 

25 Preferably, the nucleic acid encodes a polypeptide with cis- 
acting modulatory activity on antigen processing or 
presentation. 

Preferably, the nucleic acid encodes amino acids 89-327 of 
30 Epstein Barr virus (strain B95.8) nuclear antigen 1 (EBNA1) , 
as shown in Figure l. 

According to a fourth aspect of the invention, there is 
provided a nucleic acid encoding a protein according to the 
3 5 second aspect of the present invention. 

According to a fifth aspect of the invention there is 
provided a vector comprising a nucleic acid according to the 
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third or fourth aspects of the present invention. 

Preferably, the vector additionally comprises control 
sequences necessary to effect the expression of the nucleic 
5 acid sequence contained in the vector. The nature of such 
control sequences differs depending upon the host organism; 
in prokaryotes, such control sequences may include a 
promoter, a ribosomal binding site, and a transcription 
termination sequence; in eukaryotes, such control sequences 
10 may include a promoter and a transcription termination 
sequence. Additional control sequences may also be included 
such as leader sequences, enhancers and locus control 
regions (LCRs) . 

15 According to a sixth aspect of the invention there is 
provided a host cell transformed with a vector according to 
the fifth aspect of the present invention. 

The host cell may be a prokaryotic cell such as a bacterium, 
20 and is preferably a eukaryotic cell such a mammalian cell. 

According to a seventh aspect of the invention there is 
provided a method for testing for glycine-rich repeat 
sequences which confer on an antigenic protein containing 
25 such a sequence the ability to evade the immune system. 

This method comprises the steps of incubating a host cell 
that expresses the protein of the second aspect of the 
invention with a cell of the immune system that is specific 

3 0 for an epitope that is common to said protein and a 
reference protein, under conditions which allow for 
immunological recognition of said epitope, wherein said 
epitope is known or determined to be immunogenic in said 
reference protein, and wherein an absence of immunological 

35 recognition is indicative of evasion of the immune system. 

Preferably, said immunological recognition is MHC class I 
restricted lysis of said host cell bearing said reference 
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protein or epitope thereof. Alternatively, said 
immunological recognition comprises the ability of a host 
cell bearing said reference protein or epitope thereof to 
induce production of a lymphokine, such as INF, TNF, or one 
5 of the inter leukins. 

According to an eighth aspect of the invention there is 
provided a method of selectively evading an immune response 
in a mammal, preferably a human, comprising administering to 
10 a mammal a protein according to the second aspect of the 
present invention, whose core protein is immunogenic in the 
absence of the glycine-rich repeat sequence and non- 
immunogenic in the presence of said glycine-rich repeat 
sequence. 

15 

According to a ninth aspect of the invention there is 
provided a method of selectively evading an immune response 
in a mammal, preferably a human, comprising administering to 
a mammal a nucleic acid according to the fourth aspect of 
20 the invention encoding a protein whose core protein is 
immunogenic in the absence of the glycine-rich repeat 
sequence and non-immunogenic in the presence of said 
glycine-rich repeat sequence. 

25 According to a tenth aspect of the invention there is 
provided a process for modifying a nucleic acid segment 
encoding a protein to be expressed in cells of human or 
other species by insertion of a nucleic acid sequence 
encoding a glycine-rich repeat sequence according to the 

30 third aspect of the invention. 

Preferably, said nucleic acid segment encodes a therapeutic 
protein. Preferably, the therapeutic protein is selected 
from the group consisting of enzymes, cytokines, lympokines, 
35 cell adhesion molecules, costimulatory molecules, protein 
products of drug resistance genes or tumour suppressor 
genes. 
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In a further preferred embodiment of the present invention 
the nucleic acid segment encodes a marker gene. Preferred 
marker genes include neoR, multidrug resistance gene, 
thymidine kinase gene, 0 galactosidase, dehydrof olate 
5 reductase (DHFR) and chloroamphenicolacetyl transferase. 
The marker gene can be inserted into cells of a mammal, such 
as a human, together with a therapeutic gene or separately. 

In a further preferred embodiment of the present invention 
10 the nucleic acid segment encodes an immunogenic structural 
of regulatory protein of a viral vector. Preferably, the 
vector is a vaccinia vector, a adenovirus vector, a 
herpesvirus vector or any retrovirus vector developed for 
the purpose of gene therapy. 

15 

In a further preferred embodiment of the present invention 
the nucleic acid segment encodes a component of a vaccine to 
be used for prevention or cure of viral, bacterial or 
parasitic infections, or a cancer vaccine. 

20 

In an eleventh aspect of the invention there is provided a 
pharmaceutical composition comprising the protein of the 
second aspect of the invention or the nucleic acid of the 
fourth aspect of the invention in combination with a 
25 pharmaceutical^ acceptable carrier or diluent. 

In a twelfth aspect of the invention there is provided the 
protein of the second aspect of the invention or the nucleic 
acid of the fourth aspect of the invention for use in 
30 therapy. 

A thirteenth aspect of the invention is the use of the 
protein of the second aspect of the invention or the nucleic 
acid of the fourth aspect of the invention in the 
35 manufacture of a composition for the treatment of a disease 
by gene therapy. 

Numerous diseases may be treated using the composition of 
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the present invention including genetic and viral diseases, 
especially enzyme disorders such as Gaucher «s disease, 
cancers, immune system disorders and any other diseases 
treatable using gene therapy. 

5 

In a further aspect of the present invention there is 
provided a method of treatment comprising administering a 
therapeutically effective amount of the protein of the 
second aspect of the invention or the nucleic acid of the 
10 fourth aspect of the invention to a patient. 

In a further aspect of the present invention there is 
provided a process for providing humans or other species 
with a therapeutic protein or marker gene as defined above, 
15 containing the glycine-rich repetitive sequence described 
above . 

In a further aspect of the present invention there is 
provided a process for providing humans or other species 
20 with the vector according to the fifth aspect of the 
invention. 

In a further aspect of the present invention there is 
provided a process for providing humans or other species 
25 with recombinant vaccines, wherein the recombinant vaccine 
contains the glycine-rich repetitive sequence described 
above. 

The invention is now illustrated in the appended examples, 
30 with reference to the following figures: 

Figure 1 provides the amino acid sequence of the internal 
gly-ala repeat of the EBNA1 protein from the B95.8 EBV 
strain. 

35 

Figure 2 is a schematic outline of the full length EBNA1 
chimera, gly-ala deleted EBNAl chimeras and gly-ala 
containing EBNA4 chimera. The restriction site and position 
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of insertion of the EBNA4 416-424 epitope and EBNA1 internal 
repeat region (IR) are indicated by vertical arrows pointing 
to the amino acid number in the B95.8 sequence. The known 
EBNA1 protein domains are indicated as follows: gly-ala 
5 repeats (black box) ; gly-arg (grey box) ; nuclear 
localization signal (hatched box) ; DNA binding and 
dimerization domains (striped box) • 

Figure 3 shows results of sensitization of HLA All positive 

10 fibroblasts infected with vaccinia recombinants expressing 
the gly-ala deletion mutants and full size EBNA1 chimeras to 
lysis by All-restricted CTLs specific for the EBNA4 416-424 
epitope. Fibroblasts expressing the gly-ala deleted 
chimeras were lysed as efficiently as fibroblasts expressing 

15 the intact EBNA4 whereas fibroblasts expressing the full 
size EBNA1 chimera were not killed. A. lysis of fibroblasts 
expressing the intact EBNA4 protein (Vacc-EBNA4) , or EBNA1 
deletion mutants containing the EBNA4 416-4 24 epitope 
inserted at the Ncol (Vacc-ElAGAN-E4 ) , PflMI (Vacc-ElAGAP- 

20 E4) or Bsu36I sites (Vacc-E1AGAB-E4 ) . Mean and SE of 4 
experiments. B. lysis of fibroblasts infected for different 
lengths of time with the Vacc-ElAGAN-E4 recombinant of with 
the Vacc-E1N-E4 recombinant that contains the intact EBNA1 
chimera. The % specific lysis at 10:1 effector: target ratio 

25 in one representative experiment out of three is shown in 
the figure. 

Figure 4 demonstrates that over-expression of an intact 
EBNA1 protein does not inhibit processing of EBNA4 and 

30 presentation of the 399-408 and 416-424 epitopes. Lysis of 
HLA All positive fibroblasts expressing EBNA4 , EBNA1 or 
EBNA3 or coexpressing EBNA1+EBNA4 and EBNA3+EBNA4 by CTLs 
specific for the EBNA4 399-408 (grey box) and 416-424 (black 
box) epitopes. The % specific lysis at 10:1 effector: target 

35 ratio of one representative experiment out of three is shown 
in the figure. 

Figure 5 shows results of sensitization of HLA All positive 
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cells expressing EBNA4 or the E4IR chimera to lysis by All- 
restricted .CTLs specific for the EBNA4 399-408 and 416-424 
epitopes • HLA All positive fibroblasts (A and C) or the 
QJZsp LCL that carries a Chinese EBV isolate with mutations 
5 that abrogate HLA All restricted recognition of the 
endogenous EBNA4 protein (B and D) , were infected with the 
Vacc-EBNA4 (open triangle, open circle) or Vacc-E4IR (closed 
triangle, closed circle) recombinants at a m.o.i. for the 
indicated times (A and B) or at the indicated m.o.i, for 12 

10 hrs (C and D) before use as targets for All-restricted CTLs 
specific for the EBNA4 399-408 (open circle, closed circle) 
or 416-424 (open triangle, closed triangle) epitopes. The 
% specific lysis recorded at 10:1 effector .-target ratio in 
one representative experiment out of three performed with 

15 each effector: target combination is shown in the figure. 

EXAMPLES 

Example I describes how to make a recombinant glycine- 

20 containing sequence of the invention. Example II describes 
how to insert such sequences into a core protein to form a 
recombinant protein of the invention. Example III describes 
how to test a recombinant protein containing a glycine- 
containing sequence of the invention for effectiveness in 

25 evading the immune response. Example IV describes the EBNA1 
glycine-containing repeat sequence. Example V describes 
experiments in which a foreign epitope is inserted into 
either the EBNA1 protein or an ENBA1 deletion mutant lacking 
the glycine-containing repeat sequence, and demonstrates 

30 that only the non-deleted EBNAl protein confers non- 
immunogenicity on the inserted foreign epitope. Example VI 
describes experiments in which EBNAl was over expressed in 
the presence of another antigenic protein, and demonstrates 
that EBNAl does not confer trans-acting inhibition of 

35 immunogenicity of the antigenic protein. Example VII 
describes experiments in which the EBNAl glycine-containing 
sequence was inserted into a foreign protein, rendering the 
recombinant foreign protein non-immunogenic. 
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EXAMPLE I 

GLYCINE -CONTAINING SEQUEN CES USEFUL ACCORDING TO THE INVENTION 
Recombinant glycine-containing sequences of the invention 
5 having the following properties . Functionally, such 
sequences, when present in-cis in a recombinant protein, are 
able to render the protein non-immunogenic. Immunogenicity, 
or lack thereof, may be assessed by the capacity of the 
inserted sequence to inhibit specific cytotoxic responses 
10 measured in vitro and/or in vivo immunization tests as 
described below. 

Recombinant glycine-containing sequences are defined 
structurally according to the generic formula, and preferred 
embodiments thereof, described herein. In addition to the 
15 generic formula, sequences of the invention are 
characterized in that they contain a repetitive stretch of 
glycines interspersed by single hydrophobic and/or polar 
amino acids. The presence of intercalating amino acids is 
likely to be required to prevent glycine from forming 
20 secondary structures. 

Intercalating amino acids are likely to be small 
hydrophobic and/or polar, e.g., alanine, which has a short 
side chain consisting of a single -CH 3 group; serine, which 
also contains a single carbon atom side chain (-CH 2 OH) ; 
25 valine, leucine, threonine and isoleucine, which are 
hydrophobic amino acids having side chains ranging from 2- 
4 carbon-containing groups. Without being bound to any one 
theory, it is proposed that a short side chain (i.e., 1-4 
groups) inhibits formation of secondary structures in the 
30 target protein. Thus, Cys and Met are not likely to be 
suitable amino acids according to the invention because they 
contain a sulfhydryl group. 

A comparison of the repetitive sequence in the EBNA1 
protein of the B95.8 EBV strain and the Papio EBNAl-like 
35 antigen reveals that the generic formula provided 
hereinabove is repeated 7 times in Papio EBNA1 and 2 8 times 
in B95.8 EBNA1. In Papio, the generic motif is a regular 
repeat of GlyGlySerGlyAlaGlyAla . The EBV EBNAl repeat is 
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not a regular repeat in that the number of Gly residues 
preceding each Ala varies from 1-3. However, some 
combinations appear more frequently, i.e., 
(Gly^laGly^laGly^la) in n=9 of a total of n»28 repeats; 
5 and/or (Gly 2 AlaGly 1 AlaGly 3 Ala) in n=7 of a total of n=28 
repeats . 

It should be understood that "functional derivatives" of 
the glycine-containing sequence present in Fig. 1 (i.e., the 
EBV EBNA1 sequence) refers to any sequence which confers 
10 non-immunogenicity on the protein in which it is inserted, 
and therefore may encompass sequences of varying length and 
composition (i.e., longer or shorter or internally deleted 
sequences may be functional and are testable for function as 
described below) , and varying amino acid composition 
15 (additional amino acids may be included or exchanged for 
parts the original gly-ala sequence) as long as immune 
system evasion is maintained. 

It is preferred that the total number of residues of the 
glycine-containing sequence be divisible by the number of 
20 basic residues of the repeat: (Gly^j) X 1 (Gly^) X 2 (Gly^) Z , 
that is by 5, 6, 7, 8, 9, 10, or 11, depending upon the 
number of glycine residues present in each repeat. The 
number of repeats (n) in a glycine-containing sequence may 
be from 1-28 repeats and the total number of residues of the 
25 glycine-containing amino acid sequence is between 6 and 314. 
More preferably, n is 2, wherein the total number of 
residues of said sequence is 12-24, inclusive, or n is 7, 
wherein the total number of residues is 42-84, inclusive. 
Preferably, the total number of residues of the sequence is 
3 0 between about 4 00-60 0, where n=41 repeats with 8 residues 
remaining, and where n=66 repeats and 4 residues remaining. 

Where a glycine-containing sequence is needed according to 
the invention, for example, for insertion into an antigenic 
protein to render it non-antigenic , the B95.8 EBNA1 sequence 
35 presented in Fig. 1 may be provided as described herein. 

Alternatively, where another glycine-containing sequence 
is selected according to the generic formula and guidance 
for such sequence selection provided herein, it may be made 
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according to any cloning strategy known to one of skill in 
the art. Two alternative strategies are described below; 
i.e., by PCR amplification of a prototype sequence or by 
oligomerization of a selected minimal motif. The advantage 
5 of this latter strategy is that it can easily be modified to 
give repetitive sequences where the intervening X or Y amino 
acid or the Glycine backbone can be mutated in different 
combinations. 

10 1. PCR Amplification 

Where a DNA fragment encoding a glycine-containing sequence 
is available, the fragment may be used for PCR 
amplification, as follows. 
A 1122bp long Xmal fragment of the prototype B95.8 BamHIK 

15 region containing the Glycine repeat sequence (coordinates 
108117-109239 as described in Baer et al., Nature 310:207- 
211, 1984) may be used as template for PCR amplification. 
PCR primers are chosen immediately upstream and downstream 
of the repeats. Artificial BamHI and EcoRI sites are 

20 included in the sequences of the 5' and 3' oligonucleotides 
respectively. Amplification under low stringency conditions 
and the repetitive nature of the template allows for 
multiple priming and generation of PCR products of variable 
size. For example, the primer pair 5'- 

25 AAGGATCCAAGTTGCATTGGATGCAA-3 1 and 

5 1 -TGAATTCTCGACCCCGGCCTCCACTG-3 ' may be used in a PCR 
reaction as follows: 50nM of each primer ad 10-20 ng of each 
template are mixed in a 50 m! reaction buffer containing ImM 
Tris-HCl pH7.5, 5mM KCl, 0.15-0.5 nM Mgci 2 , 0.001% gelatin, 

3 0 200 /iM dNTP and 2 units Taq DNA polymerase. Amplification 
conditions include 30-35 cycles of denaturation at 95° C for 
1 min, annealing at 45-60° C for l min, and elongation at 
72° C for 1 min. 
The PCR products are cloned in the BamHI and EcoRI sites 

3 5 of the pGEX-T2 vector downstream of the glutathione S- 
transferase gene (GST) . The fusion protein GST-GlyAla is 
expressed in bacteria using the tac promoter. Expression of 
fusion proteins containing the repeats inserted in-frame 
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downstream of the GST gene is then screened by Western 
blotting of lysates from single transformed colonies using 
affinity purified human antibodies specific for the EBNA1 
GlyAla repeat (Dillner et al. r 1984, Proc. Nat, Aca. Sci. 
5 81; 4652) . Screening is performed after induction of 
individual bacterial clones grown in microwell plates with 
0.3 mM IPTG in 150 fil LB medium for 4 hr. 80 jxl of 
bacterial cell suspension is mixed with an equal volume of 
SDS-PAGE loading buffer and dotted onto nitrocellulose 

10 filters using a dot-blot apparatus. The filters are 
processed according to standard Western blot procedures and 
developed by ECL (Amersham) . 

Colonies expressing the GST-GlyAla polypeptide are further 
characterized to determine the size and coding capacity of 

15 the insert by restriction endonuclease analysis and 
sequencing. Plasroids encoding fusion proteins that contain 
repetitive inserts of 50 # 100, 150, 200 f 250, 300 amino 
acids (pGEX- (ElGlyAla) n) are selected for isolation of 
GlyAla encoding cassettes as described below. 

20 2. Oligomerization of a selected minimal motif, 

A set of complementary oligonucleotides encoding a core 
glycine-containing motif are synthesized with 5 1 and 3' 
overhangs to allow for oligomerization. This strategy 
offers the advantage of producing known sequences that can 

25 easily be modified to include alternative amino acids. 

Examples of complementary oligonucleotides encoding the 
core motif GlyAlaGlyAlaGlyGlyAlaGly and modifications 
thereof are provided in Table l. The expected coding 
capacity upon insertion in positive or negative orientation 

30 relative to the direction of transcription is presented in 
Table 2. After annealing to form a duplex, the 
oligonucleotides will contain 5' and 3* overhangs 
corresponding to the initial amino acid codon and the first 
base of the adjacent codon to allow the formation of head- 

35 to-tail multimers upon ligation. Annealing is performed in 
a 50^1 reaction containing 100 tiH of each primer, 0.1 M 
MgCl 2 , 10 mM Tris-HCl pH7 . 4 . The reaction mix is heated at 
72°C for 5 min and allowed to proceed at 65°C for additional 
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40 min. Ligation is performed by adjusting the annealing 
mix to 50 mM Tris pH 7.4 10 mM MgCl 2 , 10 mM DTT, 1 mM 
spermidine, 1 mM ATP , 100 ng/ml BSA and by adding 10 u of T4 
DNA ligase. The reactions are run for 1, 3, 6, 9 and 12 hrs 
5 at 15°C. Filling-in of the 3 1 recessed ends is performed 
with 0.1 u of the Klenow fragment of DNA polymerase in 50 mM 
Tris-HCl pH7.5, 7mM MgCl 2 , 1 mM DTT and 20 mM dNTPs for 20 
min at room temperature. Linear multimeric molecules are 
blunt-end ligated into the Smal site of the pGEX-T2 vector 

10 as shown in Table 3. Clones expressing the GST-repeat 
fusion protein are selected by reactivity with GlyAla 
specific antibodies, as described herein. For fusion 
proteins containing repeats for which specific antibodies 
are not available, selection is performed on the basis of 

15 size after purification from the bacteria lysates on GST- 
binding glutathione-coated sepharose beads. Expressing 
clones are selected for further characterization of the 
inserts. 

Plasmids are digested with BamHI and EcoRI that cut 
20 immediately upstream and downstream of the Smal site in the 
pGEX-T2 vector. The inserts are size fractionated in 3% 
agarose gels and visualized by UV irradiation after ethidium 
bromide staining, Alternatively, plasmids and insert 
fragments obtained after BamHI/ EcoRI digestion are end 
25 labelled with 1 unit Klenow fragment of DNA polymerase per 
Mg plasmid in 25 /il reactions containing 50 mM Tris-HCl pH 
7.5, 10 mM MgS04, 1 mM DTT, 50/ig/ml BSA Pentax fraction V, 
2 nmoles dNTPs and 2pmoles a- 32 P-dCTP. Size fractionation 
is performed in 8% acrylamide gels. Plasmids containing 
30 inserts of sizes corresponding to the coding sequences of 2, 
4, 8, 12 and 16 core repeats (48, 96, 192, 288, 384 bp, 
respectively) are further characterized by sequencing to 
confirm their coding potential and orientation of insertion. 

35 A glycine-containing repeat sequence obtained according to 
the procedures described above is subcloned into a 
polylinker of an appropriate vector to generate subcloning 
cassettes containing unique upstream and downstream 
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restriction sites. For example, subcloning into the BamHI 

and EcoRI sites of the pBluescript-SK vector (Stratagene) 

will generate subcloning cassettes containing unique 

upstream sites: Sad, BstXI f NotI, Xbal, and Spel, and 

5 unique downstream sites Hindlll, Clal, Hindi, AccI, Sail, 

Xhol, Apal, Drall and KpnI. 

EXAMPLE II 

RECOMBINANT PROTEINS OF THE INVENTION 
CONTAINING GLYCINE-CONTAINING SEQUENCES 
10 AND CHIME RIC GENES ENCODING SUCH PROTEINS 

Recombinant proteins of the invention which contain a 
glycine-containing sequences are made as follows. 
15 1. Construction of chimeric genes encoding proteins 
containing Glycine repeat sequence • 

A DNA cassette encoding a Gly-containing sequence is 
inserted into a selected site within a gene encoding a 
protein which is believed or known to be immunogenic in its 

20 natural form. 

DNA encoding a candidate glycine-containing sequence is 
inserted into a gene encoding a protein of interest at a 
selected distance from a known epitope using conventional 
cloning procedures. It is preferred that the gly-containing 

25 sequence be inserted downstream (i.e., carboxy terminal to) 
of a selected epitope (i.e. an epitope to be rendered 
nonimmunogenic) , e.g., at least 5 amino acids carboxy 
terminal to the epitope, more preferably at least 10-20 
amino acids, at least 50 amino acids, 100 amino acids, or 

30 even at least 200 amino acids carboxy terminal to the 
epitope. Of course, the gly-containing sequence may be 
positioned at the carboxy terminus of the protein via an 
in-frame fusion. 
By way of example, the cloning strategy for insertion of 

35 DNA encoding the glycine sequence (generically referred to 
as ("GlyX)n") into the influenza matrix protein is described 
in detail below. 

Three unique restriction sites, Rsal, BamHI and StuI, are 
used for insertion of a (GlyX) n-encoding DNA cassette into 

40 the cDNA encoding the influenza matrix protein I from strain 
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A/Puerto Rico/8/34 (matrix) (Winter et al., Nucl. Acids Res. 
8:1965, 1980). In addition, the cassette encoding the Gly- 
containing sequence is inserted at the 3' end of the gene, 
immediately downstream of the matrix coding sequence. An 
5 immunodominant HLA A2 restricted CTL-specific epitope, 
referred to as GIL, has been identified in amino acid 
residues 58-66 (GILGFVFTL) of this 252 amino acid long 
protein (Bednerck et al., J. Immunol. 147:4047, 1991). The 
Rsal site is located at bp 28 relative to the initiating ATG 
10 codon, 47 amino acid residues upstream of the GIL epitope. 
The BamHI site is located at bp 264, 22 amino acids 
downstream of the GIL epitope and the StuI site is located 
at bp 713, 172 residues downstream of the epitope near the 
COOH terminus. The matrix cDNA is excised by EcoRl 
15 digestion of pGS62 (Smith et al., Virol, 160:336, 1987) and 
recloned into the EcoRl site of pGEM-9zf(-) (Promega) to 
create pGEM-matrix. The pGEM-matrix contains the matrix 
protein driven by the SP6 and T7 promoters for in vitro 
transcription/translation . 
20 Construction of a chimeric gene containing a (GlyX)n- 
encoding cassette inserted into each of four different 
positions within the selected protein is described below. 
In each cloning example, care is taken to obtain cassette 
insertion in-frame within the coding sequence of the given 
25 protein, It will be understood by those of skill in the art 
that the examples presented below are representative cloning 
strategies for in-frame cassette insertion; modifications 
will be made as needed for a candidate (GlyX)n DNA cassette 
and a gene encoding a selected protein. 
30 A DNA cassette encoding a Gly-containing sequence is 
subcloned into the Rsal site of the gene encoding the 
influenza matrix protein, as presented in Table 4 and as 
follows. pGEM-matrix plasmid is partially digested with 
Rsal. pGEM-9zf(-) contains a unique Rsal site located 
35 within the amp* gene. Recombinants containing the cassette 
inserted into this site will be selected against in this 
cloning procedure due to loss of amp resistance. A DNA 
fragment containing the (GlyX)n cassette is isolated via 
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BamHI digestion, the BamHI 3 1 recessed ends of the pGEX- 
T2(GlyX)n plasmid are then partially filled-in with A and G 
nucleotides, the DNA then digested with EcoRI and the 5' 
protruding ends are removed by Mung bean exonuclease 
5 treatment. The modified fragment is blunt-end ligated into 
the Rsal site of the matrix protein coding region in the 
pGEM-matrix plasmid. Insertion in positive orientation will 
yield a chimeric gene encoding amino acids 1-9 of influenza 
matrix protein followed by the sequence PhePro (GlyX) nAsp and 

10 amino acids 11-252 of influenza matrix protein. 

Subcloning of the (GlyX)n DNA cassette into the BamHI site 
of the gene encoding the matrix protein is presented in 
Table 5, and includes the following steps. The pGEM-matrix 
plasmid is digested with BamHI followed by Klenow of the 3 1 

15 recessed ends. The (GlyX) n-encoding fragment is isolated 
from pGEX-2T(GlyX)n by BamHI /EcoRI digestion followed by 
partial filling of 3* recessed ends using A and G 
nucleotides , and Mung bean exonuclease treatment of the 
remaining 5 1 protruding ends. The (GlyX)n DNA fragment is 

20 blunt-end ligated into the filled-in BamHI ends of the pGEM- 
matrix plasmid. Insertion in positive orientation produces 
a chimeric gene encoding amino acids 1-88 of influenza 
matrix protein followed by the sequence LeuPro(GlyX) nGlu and 
amino acids 89-252 of influenza matrix protein. 

25 Subcloning of the (GlyX)n DNA cassette into the StuI site 
of the gene encoding matrix protein is presented in Table 6 
and includes the following steps. The pGEM-matrix plasmid 
is digested with StuI. The Gly-encoding cassette is 
isolated by BamHI digestion of the pGEX-2T-GlyX plasmid 

30 followed by partial filling in using A and G nucleotides. 
The plasmid is then digested with EcoRI and the 5 1 
protruding ends are removed by Mung bean exonuclease 
treatment. The (GlyX)n fragment is blunt-end ligated into 
the StuI site of the pGEM-matrix plasmid. Insertion in 

35 positive orientation produces a chimeric gene encoding amino 
acids 1-238 of influenza matrix protein followed by the 
sequence ValPro (GlyX) n and amino acids 239-252 of the matrix 
protein. 
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A fusion protein with a carboxyl terminal (GlyX)n sequence 
may also be made according to the invention. Subcloning of 
a DNA cassette encoding a (GlyX)n sequence into the 3 f end 
of the influenza matrix protein gene is presented in Table 
5 7 and includes the following steps. DNA encoding matrix 
protein is prepared by PCR amplification using the following 
respective 5 ' and 3' primers: 
5 1 ATAAAGCTTATGAGTCTTCTAACCGAGGTC3 ■ and 
5 1 CGAGGATCCACTTGAACCGTTGCATCTGC3 1 . The 5' primer contains 
10 an artificial Hindlll site upstream of the first ATG and the 
3 • primer contains a BamHI site downstream of the codon for 
the last residue of the matrix protein. 

The (GlyX)n DNA cassette is isolated by digestion of pGEX 
(GlyX)n with EcoRI, removal of 5' protruding ends with Mung 
15 bean nuclease, and digestion with BamHI. The plasmid into 
which the matrix protein coding sequence and the (GlyX)n 
cassette are inserted is pGEM-9zf digested with Xbal, 
treated with Mung bean nuclease, and further digested with 
Hindlll. 

20 The three thus-prepared DNAs , i.e., matrix-encoding DNA, 
(GlyX) n-encoding DNA, and the pGEM vector, are ligated in a 
single reaction involving site-directed ligation of matching 
Hindlll ends, BamHI ends, or blunt-ends. The resulting 
plasmid, pGEM-matrix- (GlyX) n-COOH encodes a 242 amino acid 

25 matrix protein fused in-frame via its carboxyl terminal 
residue to the {GlyX)n sequence. Where the amino acid 
sequences are joined, two new residues (Trp and He) are 
created by the BamHI ligation. A UAG stop codon is present 
in-frame downstream of the fusion protein. 

3 0 The correct orientation and reading frame alignment of the 
(GlyX) n-encoding insert is determined by DNA sequencing. 
Expression of the chimeric protein will be determined by 
western blots of in vitro translated material using 
antibodies specific for the influenza matrix protein (Smith 

35 et al., Virol., 160:336, 1987) and affinity purified anti- 
(GlyX)n antibodies, as described above. When desired, 
optimization of inhibition of immunogenicity of the protein 
of interest may be achieved by varying the size of the 
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glycine-containing sequence or by varying the position of 
insertion of the gly-containing sequence relative to the 
immunogenic epitope. 

2. Expression of chimeric proteins containing the Gly repeat 
5 sequence in eukaryotic cells. 

Delivery systems are known in the art for introduction of 
foreign genes into eukaryotic cells. A chimeric gene 
according to the invention may be inserted into an 
appropriate delivery vehicle for delivery to a target cell. 
10 The delivery vehicle may be a viral vector or a non-viral 
vehicle. Where the chimeric gene is first inserted into an 
appropriate nucleic acid vector, that vector may be a 
plasmid, a virus or a linear DNA fragment, as desired. The 
vector may be naked, complexed with proteins or packaged in 
15 a delivery system such as a liposome, virosome, or a 
receptor mediated complex. 

It is contemplated in the invention to express in specific 
cells a chimeric gene encoding a recombinant protein joined 
in-frame to a gly-containing sequence. The chimeric gene 
20 may be expressed in vivo or ex vivo as a result of 
introduction of genetic material containing the chimeric 
gene into cells which include but are not limited to stem 
cells, macrophages, T-cells, dendritic cells, cells of 
hematopoietic lineage, somatic cells and tumour cells. 
25 Techniques are known in the art for delivery of recombinant 
DNA to these cells. Such cells can be transfected ex vivo , 
i.e., after removal from the body, and then re-introduced 
into the body, or they can be targeted in vivo . 
Transfer of a chimeric gene according to the invention can 
3 0 be accomplished through many means, including but not 
limited to DNA transfection using calcium phosphate 
coprecipitation, fusion of the target cell with liposomes 
containing the gene, erythrocyte ghosts or spheroplasts 
carrying the gene, plasmid and viral vector-mediated 
35 transfer, and DNA protein complex-mediated gene transfer 
such as receptor-mediated gene transfer. 
1. Targeting of Cells Ex Vivo 
For introduction of DNA into a cell ex vivo , a number of 
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protocols for the transfection of various types of cells are 
known in the art. 

Some transfection techniques involve the isolation of stem 
cells from total cell populations, as described in, for 
5 example, European patent applications 0 455 482 and 0 451 
611. T-cell transfection is also described in Kasid et al. , 
Proc. Nat. Aca. Sci., 1990, 87(1) :473. Transfection of 
macrophages is described in Freas et al., Human Gene 
Therapy, 1993, 4(3): 283, and in Krall et al., Blood, 1994, 
10 83(9)L2737. Stem cell transfection is also described in Yu 
et al., Proc. Nat. Aca. Sci., 1995, 92(3):699, Lu et al., 
Human Gene Therapy, 1994, 5:203, Walsh et al. Proc. Soc. 
Exp. Biol. Med., 1993, 204:289; Weinthal et al. Bone Marrow 
Transplant. ,1991, 8:403, Hamada et al. , Jour. Immunol. 
15 Methods, 1991, 141:177. 

Polylysine tagged with asialoglycoprotein may be used to 
complex and condense DNA and target the complex to 
hepatocytes (Wu and Wu, (1987) J. Biol. Chero. 262, 4429; US 
Patent 5,166,320). DNA transfer is believed to occur via 
20 the asialoglycoprotein tag specifically directing the 
complex to only those cells expressing the 
asialoglycoprotein receptor. Monoclonal antibodies also may 
be used to target DNA to particular cell types, as described 
in Machy et al., (1988) Proc. Natl. Acad. Sci. 85, 8027- 
25 8031; Trubetskoy et al., (1992) Bioconjugate Chem. 3, 323- 
7 and WO 91/17773, WO 92/19287. Lactosylated polylysine 
(Midoux et al (1993) Nucleic Acids Res. 21, 871-878) and 
galactosylated histones (Chen et al (1994) Human Gene 
Therapy 5, 429-435) have been shown to target plasraid DNA 
30 to cells bearing lectin receptors, and insulin conjugated to 
polylysine (Rosenkrantz et al (1992) Exp. Cell Res. 199, 
323-329) to cells bearing insulin receptors. 
2. Targeting of Cells In vivo 
In vivo cell targeting may be accomplished according to the 
3 5 invention using receptor-mediated gene transfer. 

Receptor-mediated gene transfer is dependent upon the 
presence of suitable ligands on the surfaces of cells which 
will allow specific targeting to the desired cell type 
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followed by internalization of the complex and expression of 
the DNA . One form of receptor-mediated gene transfer is 
wherein a DNA vector is conjugated to antibodies which 
target with a high degree of specificity for cell-surface 
5 antigens (Wong and Huang, 1987, Proc. Nat. Aca. Sci. 
84:7851; Roux et al., 1989, Proc. Nat. Aca. Sci. 86:: 9079; 
Trubetskoy et al., 1992, Bioconjugate Chem. 3:323; and 
Hirsch et al., 1993, Transplant Proceedings 25:138). 
Nucleic acid may be attached to antibody molecules using 

10 polylysine (Wagner et al., 1990, Proc. Nat. Aca. Sci. 
87:3410; Wagner et al., 1991, Proc. Nat. Aca. Sci. 89:7934) 
or via liposomes, as described below. 

Increased expression of DNA derived from ligand-DNA 
complexes taken up by cells via an endosomal route has been 

15 achieved through the inclusion of endosomal disruption 
agents, such as influenza virus hemagglutinin fusogenic 
peptides, either in the targeting complex or in the medium 
surrounding the target cell. 
Targeted gene delivery is also achieved according to the 

2 0 invention using a DNA-protein complex. Such DNA-protein 

complexes include DNA complexed with a ligand that interacts 
with a target cell surface receptor. Cell surface receptors 
are thus utilized as naturally existing entry mechanisms for 
the specific delivery of genes to selected mammalian cells. 
25 It is known that most, if not all, mammalian cells possess 
cell surface binding sites or receptors that recognize, bind 
and internalize specific biological molecules, i.e., 
ligands. These molecules, once recognized and bound by the 
receptors, can be internalized within the target cells 

3 0 within membrane-limited vesicles via receptor-mediated 

endocytosis. Examples of such ligands include but are not 
limited to proteins having functional groups that are 
exposed sufficiently to be recognized by the cell receptors. 
The particular proteins used will vary with the target cell. 

35 

Typically, glycoproteins having exposed terminal 
carbohydrate groups are used although other ligands such as 
antibodies or polypeptide hormones, also may be employed. 
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Generally, a ligand is chemically conjugated by covalent, 
ionic or hydrogen bonding to the nucleic acid. A ligand for 
a cell surface receptor may be conjugated to a polycation 
such as polylysine with ethylidene diamino carbodiimide as 
5 described in U.S. Patent No. 5,166,320. DNA may be attached 
to an appropriate ligand in such a way that the combination 
thereof or complex remains soluble, is recognized by the 
receptor and is internalized by the cell. The DNA is 
carried along with the ligand into the cell, and is then 

10 expressed in the cell. The protein conjugate is complexed 
to DNA of a transfection vector by mixing equal mass 
quantities of protein conjugate and DNA in 0.25 molar sodium 
chloride. The DNA/protein complex is taken up by cells and 
the gene is expressed. 

15 Liposomes have been used for non-viral delivery of many 
substances, including nucleic acids, viral particles, and 
drugs. A number of reviews have described studies of 
liposome production methodology and properties, their use as 
carriers for therapeutic agents and their interaction with 

20 a variety of cell types. See, for example, "Liposomes as 
Drug Carriers," Wiley and Sons, NY (1988), and "Liposomes 
from Biophysics to Therapeutics," Marcel Dekker, NY (1987). 
Several methods have been used for liposomal delivery of DNA 
into cells, including poly-L-lysine conjugated lipids (Zhou 

25 »2t al., Biochem. Biophys. Acta. 1065 : 8-14 , 1991), pH 
sensitive immunoliposomes (Gregoriadis , G. , Liposome 
Technology, Vol I, II, III, CRC, 1993), and cationic 
liposomes (Feigner et al., Proc. Natl. Acad. Sci., USA, 
41:7413-7417, 1987). Positively charged liposomes have been 

30 used for transfer of heterologous genes into eukaryotic 
cells (Feigner et al., 1987, Proc. Nat. Aca. Sci. 84:7413; 
Rose et al., 1991, BioTechniques 10:520). Cationic 
liposomes spontaneously complex with plasmid DNA or RNA in 
solution and facilitate fusion of the complex with cells in 

35 culture, resulting in delivery of nucleic acid to the cell. 
Philip et al. 1994, Mol. and Cell. Biol. 14:2411, report the 
use of cationic liposomes to facilitate adeno-associated 
virus (AAV) plasmid transfection of primary T lymphocytes 
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and cultured tumour cells. 

Delivery of an agent using liposomes allows for noninvasive 
treatment of diseases. Targeting of an organ or tissue type 
may be made more efficient using immunoliposomes, i.e., 
5 liposomes which are conjugated to an antibody specific for 
an organ-specific or tissue-specific antigen. Thus, one 
approach to targeted DNA delivery is the use of loaded 
liposomes that have been made target-specific by 
incorporation of specific antibodies on the liposome 

10 surface. 

3. Viral Vector Mediated Gene Delivery 

Recombinant viral vectors as well as other DNA transfer 
schemes can be used in practice of the present invention. 
A recombinant viral vector of the invention will include DNA 

15 of at least a portion of a viral genome which portion is 
capable of infecting the target cells and the transcription 
unit and control DNA sequence. By "infection" is generally 
meant the process by which a virus transfers genetic 
material to its host or target cell. Preferably, the virus 

20 used in the construction of a vector of the invention is 
also rendered replication-defective to remove the effects of 
viral replication on the target cells. In such cases, the 
replication-defective viral genome can be packaged by a 
helper virus in accordance with conventional techniques. 

25 Generally, any virus meeting the above criteria of 
infectiousness and capabilities of functional gene transfer 
can be employed in the practice of the invention. 

Suitable viruses for practice of the invention include but 
are not limited to, for example, adenoviruses, adeno- 

3 0 associated virus, retroviruses, and vaccinia viruses, 
representative examples of which follow. Insertion of a 
chimeric gene of the invention into a viral vector involves 
conventional cloning sequences know to those of skill in the 
art. 

35 Viral systems exploit the infectious capacity of viruses 
to introduce foreign DNA to eukaryotic cells with high 
efficiency. The gene of interest is cloned into the viral 
genome under the control of a promoter that is expressible 
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in the selected host cell or organism. Recombinant vaccinia 
viruses, retroviruses and adenoviruses have often been used 
for DNA delivery purposes. A viral delivery vehicle is 
selected based on know advantages and disadvantages of its 
5 use. One of skill in the art will select a delivery vehicle 
weighing such by considerations in view of the intended 
therapeutic use of the non- immunogenic recombinant protein. 

For example, integration of the viral vector into the host 
cell genome occurs after infection with a recombinant 

10 retrovirus- Such integration results in advantageous long- 
term expression of the transduced gene. However, 
retroviruses are limited in their use as vectors in that 
they infect only mitotic cells. Recombinant adenoviral 
vectors have the capacity to infect a wide spectrum of 

15 mitotic and postmitotic target cell but are maintained in 
the cell in episomal form, which may result in loss of the 
transduced gene in parallel with the cellular turnover. 
Recombinant vaccinia viral vectors can confer very high 
levels of expression of the transduced gene, and can infect 

2 0 a broad spectrum of target cells. However, this infection 
is often productive, resulting in rapid clearance of virus 
infected cells. 

Described in detail below is the construction of 
recombinant vaccinia viruses retroviruses and adenoviruses, 

25 each carrying an influenza matrix chimeric gene of the 
invention. 

a) Construction of recombinant Vaccinia virus containing 
sequences encoding Influenza Matrix/Gly-containing sequence 
recombinant protein. 

30 Vaccinia virus recombinants carrying a selected protein 
coding sequence, for example, influenza matrix 1 protein, 
joined in-frame to a Gly-containing sequence are produced 
according to standard procedures (Rj Murray et al. 
J. Exp. Med., 176:157, (1992)). A transfer vector may be 

35 constructed by cloning the influenza matrix cDNA/(GlyX)n 
chimeric gene into the viral vector. For example, the 
chimeric gene may be isolated on an EcoRI fragment from 
pGEM-matrix- (GlyX) n or pGEM-matrix- ( El GlyX)n and cloned 
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into the EcoRI site of pGS62 downstream of the vaccinia p7.5 
early late promoter, which is the promoter of choice for 
expression of target proteins for CTL recognition (BEJ 
Coupar et al., Eur. J. Immunol. , 16:1479, (1986)). Insert- 
5 containing plasmids are sequenced to determine correct 
orientation and reading frame alignment. Recombinant 
viruses are generated by transfection into wild-type-WR- 
strain-vaccinia infected TK-143 cells (S. Chakrabati et al., 
Mol. Cell. Biol., 5:3403 (1985)). Recombinants carrying 
10 insertions into the vaccinia thymidine kinase gene (TK) are 
selected in medium containing 25/xg/ml BuDR. Viral stocks are 
prepared and titrated in CV-1 cells according to standard 
procedures. 

A recombinant vaccinia vector encoding a matrix protein 
15 fused to a gly-containing sequence at the matrix protein 
carboxyl terminus is generated as described above, with two 
differences. First, the chimeric gene is excised from pGEM- 
matrix-(GlyX)n-COOH by Hindlll-Sfil digestion. Second, 
cloning of the chimeric gene fragment into the EcoRI site of 
2 0 the vaccinia vector pGS62 is accomplished via blunt-end 
ligation after filling-in of both vector and insert ends 
with Klenow fragment or T4 DNA polymerase (for the Sfil 
end) . 

2 5 b) Construction of a recombinant retrovirus vector 
containing Influenza matrix/Gly-seguence repeat chimeric 
gene* 

Recombinant retroviral vectors carrying a gene encoding a 
protein of interest or a chimeric gene encoding a 

30 recombinant protein of the invention are made as described 
by Trivedi et al., Int. Jour. Cancer 48:794, 1991. 
Described below, by way of example, is the construction of 
recombinant retroviral vector carrying a gene encoding the 
influenza matrix protein or a gene encoding influenza 

35 matrix/Gly-containing sequence recombinant protein. 

The chimeric gene is excised from the pGEM-matrix (GlyX) n 
or pGEM-matrix(ElGlyX) n or pGEM-matrix (GlyX) nCOOH by Hindlll 
and Sail digestion, and subcloned into the Hindlll-Xhol site 
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of the pCEP4 plasmid (Invitrogen) , resulting in loss of the 
Sail and Xhol sites. The pCEP4 -matrix and pCEP4- 
matrix(GlyX)n plasmids are then cleaved by Sail and Sfil to 
isolate fragments containing the matrix gene or influenza 
5 matrix chimeric gene, driven by the CMV promoter. After 
modification of the Sail 3 ' recessed ends by Klenow and the 
Sfil 3' protruding ends by T4 DNA polymerase, the isolated 
fragments are blunt-end ligated into the SnabI site of the 
pShis4 vector. Helper virus-free viral supernatants are 
10 then produced by transfection into the pA317 packaging cell 
line according to standard procedures (Miller et al., Mol. 
Cell. Biol. 6:2895, 1986). 

c) Construction of a recombinant adenovirus vector 
containing Influenza matrix/Gly sequence chimeric gene. 

15 Recombinant adenoviral vectors carrying a gene encoding a 
protein of interest or a chimeric gene encoding a 
recombinant protein of the invention are made as follows. 
Described below, by way of example, is the construction of 
recombinant adenoviral vector carrying a gene encoding the 
20 influenza matrix protein or a gene encoding influenza 
matrix/ (GlyX) n recombinant protein. 

A Sail fragment containing the CMV promoter, the influenza 
matrix cDNA or influenza matrix chimeric gene and the SV40 
polyA tail is excised from the pCEP4 matrix or the pCEP4- 
25 matrix (Gly-X)n plasmid and subcloned into the Sail site of 
the adenovirus transfer plasmid paElspl, containing 
adenovirus El sequences on both sides of the cloning site 
(Bett et al., Proc. Nat. Aca. Sci. 91:8802, 1994). 
Recombinant adenovirus is generated by homologous 
30 recombination between the pAElspl-raatrix or paElspl-matrix 
(GlyX)n plasmid and pJM17 containing a complete adenovirus- 
5 genome with a partially truncated El after cotransf ection 
into Ad5 El expressing 293 cells (Graham et al., 
J.Gen,Virol, 36:59, 1977). Cotransf ection, virus rescue, 
35 and propagation will be performed according to the standard 
methods (Graham et al., in Gene Transfer and Expression 
Protocols, E.J. Murray ed. , Human a Press, Clifton, NY, 
p, 109-128, 1991). 
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Expression of the chimeric gene may be detected by 
immunological methods, such as detection of the recombinant 
protein via immunofluorescence and immunoblotting using an 
antibody specific for the Gly-containing sequence and an 
5 antibody specific for the protein of interest. 
Alternatively, expression of the chimeric gene may be 
detected by an assay for the biological activity of that 
protein; for example, where the chimeric gene is a marker 
gene such as a NeoR chimera or a TK chimera, expression of 
10 the marker protein may be detected. 

EXAMPLE III 

HOW TO TEST FOR EVASION OF THE IMMUNE SYSTEM 
The procedures described in this example allow one of skill 
in the art to determine whether the presence of glycine- 
15 containing repetitive sequences influences the recognition 
of a foreign protein by the cellular immune system; i.e., 
confers non-immunogenicity on a protein which is immunogenic 
in the absence of the glycine repeat sequence. 
Testing the antigenicity and immunogenicity of proteins 
20 containing a Gly-containing repeat sequence. 

The effect of the presence of a Gly repeat sequence on 
processing/presentation of a protein which in its natural 
form is immunogenic may be tested according to in vitro and 
in vivo assays. By way of example, the immunogenicity of a 
25 recombinant influenza matrix protein 1 containing an 
inserted glycine-containing sequence is determined via in 
vitro and in vivo testing below. 

An in vitro assay is dependent on the availability of 
polyclonal T-cell cultures or clones specific for the 
30 protein of interest and/or for identified target epitopes in 
a given protein, as described by Levitskaya et al. (Nature 
375:685-688, 1995) . 

CTL epitopes useful in testing candidate recombinant 
proteins of the invention are known in the prior art. CTL 
35 epitopes are known for many different antigens, for example, 
for viral, bacterial and parasite antigens. In humans, such 
epitopes include but are not limited to the following: 
i. The HLA A2 restricted epitope located in residues 57- 
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68 of the influenza A matrix protein (Gotch et al., Nature, 
326: 331-332, (1987); Moss et al. , Proc.Natl.Acad.Sci. USA, 
88: 8987-8990, (1991)); 

ii. The HLA B27 and B8 restricted epitopes in HIV pl7 
5 Gag and p24 Gag, respectively (Nixon et al., Nature, 336: 
484-487, (1988); Nixon & McMichael, AIDS 5: 1049-1059, 
(1991)); 

iii. The HLA A2 restricted epitope in the EBV membrane 
protein LMP2 (Lee et al., J.Virol., 67: 7428-7235, (1993)); 
10 and 

iv. The HLA B53 restricted CTL epitope in the liver- 
stage-specific antigen (LSA-1) from P. falciparum (Hill et 
al., Nature, 360: 432-439, (1992)). 

Examples of such epitopes in mouse systems include but are 

15 not limited to: 

i. The H-2Kd restricted epitope in amino acids 249-260 of 
the Plasmodium berghei circumsporozoite protein (Romero et 
al., Nature, 341: 323-325, (1989)). These CTLs can protect 
mice against parasitic infection; and 

20 ii. The H-2kD restricted epitope in animo acids 91-99 

of the listeriolysin of Listeria monocytogenes (Palmer et 
al., Nature, 353: 852-855, (1991)). 

The effect of the Gly-containing sequence on 
processing/presentation of these proteins will be performed 

25 following the strategy described herein for influenza matrix 
protein. That is, a chimeric protein that contains a Gly 
repeat sequence inserted within the protein relative to a 
known CTL epitope is expressed in cells of the appropriate 
MHC class I type using a viral or non-viral delivery system. 

3 0 Several such proteins may be tested, each having a different 
sequence inserted into the same site within the protein or 
having the same sequence inserted into different positions 
relative to the known CTL epitope. Thus, both the size of 
the gly-containing sequence required to achieve inhibition 

35 of antigen processing and the position of insertion of the 
repeat within the coding sequence relative to the CTL 
epitope may be determined. 
To this end, repeats of different length and/or different 
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10 



15 



sequence are inserted into a given protein. The repeats are 
inserted at the NH2 or COOH terminus of the protein of 
interest or within the coding sequence at various distances 
from the known epitope starting from immediately upstream or 
downstream of the epitope. 

A chimeric gene encoding a recombinant protein of the 
invention comprising the influenza matrix protein 1 and a 
glycine-containing sequence is introduced and expressed in 
HLA A2 positive fibroblasts, EBV transformed lymphoblastoid 
cells lines (LCL) , or mitogen-induced blasts according to 
gene delivery methods described herein. Chimeric gene 
expression is confirmed by immunof luoresence using an 
antibody specific for influenza matrix protein l and an 
antibody specific for the glycine-containing sequence. 
Chimeric gene expression also may be confirmed via Western 
blotting, according to standard procedures. A preparation 
comprising host cells expression the transduced chimeric 
gene is used as a target population in a cytotoxicity assay. 
Cytotoxic effector cells may be a polyclonal CTL population 
20 from an HLA A2 positive individual stimulated with 
autologous influenza A virus-infected cells or CTL clones 
specific for the 58-66 epitope, as described (Morrison et 
al., Eur. Jour. Immunol. 22:903, 1992). 
Alternatively, immunological recognition of the transduced 
25 targets may be assessed by the capacity of these cells to 
induce production of lymphokines, such as TNF, INF, 
interleukins, etc., as taught by Asher et al., Jour. 
Immunol. 138:963, 1987. The following TNF assays are 
standard assays for testing for the presence of TNF in T 
30 cell supernatants. The L929 assay is based on the capacity 
of supernatants derived from cocultivation of T lymphocytes 
with cells expressing the relevant antigen to inhibit growth 
of TNF sensitive L929 cells. 

Alternatively, immunological recognition of the transduced 
target cells is assessed via ELISA assays (R&D Systems, 
Mpls, MN) using a commercially available ELISA kit based on 
TNF-specif ic monoclonal antibodies. 
An in vivo assay is based on the capacity of the protein 



35 
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of interest to induce rejection of transfected murine tumour 
cell lines in syngeneic hosts, as described be Trivedi et 
al., Int. J. Cancer 48:794 (1991). For example the 
immunogenicity of the influenza matrix protein and 
5 nonimmunogenicity of the matrix protein fused in-frame to a 
glycine-containing sequence , as described herein, may be 
tested as follows. 

The S6C cell line, established from a spontaneous mammary 
carcinoma line originating in an ACA mouse (H-2f ) (Kuzumaki 
10 et al., Europ. J. Cancer 39:471, 1987) is transfected with 
the pCEP matrix or pCEP matrix(GlyX) n plasraid. High levels 
of expression of the transfected gene is achieved using the 
CMV promoter to drive that gene. ACA mice are immunized 
with cells harboring one of the following: (1) no vector, 
15 (2) pCEP vector with no inserted chimeric gene, (3) pCEP 
plasmid containing the gene encoding the influenza matrix 
protein 1, and (4) pCEP matrix (Gly)n plasmid. See Trivedi 
et al., Int. Jour. Cancer 48:794, 1991. 1 x 10 6 cells, 
irradiated with 10,000 rads, are inoculated subcutaneously 
2 0 once a week for three consecutive weeks. Seven days after 
the last immunization and 24 hours before the viable cell 
challenge, the mice are irradiated with 400 rad to inhibit 
nonspecific immune reactions (Klein et al., Cold Spring 
Harbor Symp. Quant Biol. 27;463, 1962). Groups of 5-10 mice 
25 are challenged with graded numbers of viable cells/mouse, 
e.g., 10 3 -10 6 cells/mouse. Tumour growth is followed by 
weekly caliper measurement in 3 dimensions. The animals are 
dept under observation for a period of 8 weeks after 
challenge. The mean tumour load is calculated by adding the 
30 individual tumour diameters and dividing the sum by the 
total number of mice in the group. The statistics of tumour 
take is calculated according to the Fisher's exact test. 

EXAMPLE IV 

EBNA1 GLYCINE-CONTAINING REPEAT SEQUENCE 
35 In one embodiment of the invention, a repetitive nucleic 
acid sequence encoding a polypeptide corresponding to the 
Epstein-Barr virus nuclear antigen (EBNA) 1 internal gly-ala 
repeat sequence, or functional derivatives thereof, is 
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inserted into a gene encoding a protein of interest. The 
gene is then transferred into and expressed in a host cell 
such that the expressed recombinant, glycine sequence- 
containing polypeptide is non- immunogenic, i.e,, is not 
5 recognized by cytotoxic T lymphocytes. 

According to the invention, a polypeptide corresponding to 
the Epstein Barr Virus nuclear antigen (EBNA)l internal gly- 
ala repeat (amino acid 89-327 of the prototype B95.8 EBV 
strain) is shown in Fig. 1. A recombinant protein is formed 

10 by placing this polypepide within a selected site in an 
antigenic protein to render the protein non-antigenic. 

The Epstein-Barr virus (EBV) encoded nuclear antigen 
(EBNA)l is expressed in latently EBV infected B- lymphocytes 
that persist for life in healthy virus carriers (Tierney, 

15 R.J. at al J. Virol. 68, 7374-7385 1994) , and is the only 
viral protein regularly detected in all EBV associated 
malignancies (reviewed in Masucci, M.G. & Ernberg, I. Trends 
in Microbiology 2, 125-130, 1994, Klein, G. Cell 77, 791- 
793, 1994). Major histocompatibility complex (MHC) class 1 

20 restricted, EBNA1 -specific cytotoxic T lymphocyte (CTL) 
responses have not been demonstrated (reviewed in Masucci, 
M.G. & Ernberg I. Trends in Microbiology 2, 125-130, 1994, 
Rickinson, A. , et al. in A New Look at Tumour Immunology, 
eds. McMichael, A & Franks, L, pp 53-80, CSHL Press, New 

25 York, 1992) suggesting that this viral antigen may have 
evolved the capacity to escape immune responses. 

EBNA1 is a phosphoprotein (Rawlins, D.R. el al. Cell 42, 
859-868, 1985) composed of unique N- and C-terminal domains 
(amino acids 1-89 and 327-641, respectively, in the 

30 prototype B95.8 EBV strain) joined by a repetitive sequence 
of arg-gly containing motifs surrounding an internal gly- 
ala repeat (Baer, R. , et al. Nature 310, 207-211, 1984) 
(Figure 1) . 

It has been observed that Gly-ala repeats of different 
35 lengths are present in all EBV isolates, and represent the 
major target of EBNA specific antibody responses. However, 
the function of the gly-containing sequence was previously 
unknown. The experiments described below were designed to 
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test whether the presence of the repetitive sequence which 
encompasses more than one third of the ENBAl molecule, 
influences the recognition of EBNA1 by the cellular immune 
system. 

5 

EXAMPLE V 

INSERTION OF FOREIGN EPITOPE S INTO EBNA1 
Taking advantage of the observation that EBV induced 
CTL responses in Caucasian HLA All positive individuals are 
10 often dominated by All-restricted reactivities to peptide 
epitopes corresponding to residues 399-408 and 416-424 of 
the EBNA4 protein (Gavioli, R. , et al.J. Virol. 67, 1572- 
1578 f 1993, de Campos-Limas , P.O., et al.J. Exp Med. 179, 
1297-1305, 1994), the immunodominant EBNA4 416-424 epitope 
15 was inserted in frame within the intact EBNA1 sequence, or 
within EBNA1 deletion mutants that did not contain the 
glycine-alanine repeats (Figure 2). The generation of 
vaccinia virus recombinants carrying the coding sequences of 
the EBNA1 and EBNA4 (also known as EBNA-3B) genes from the 
20 B95-8 virus has been described previously (Murray, R. , et al 
J* Exp Med. 176, 157-168, 1992). Chimeric proteins 
containing the HLA All restricted EBNA4 416-424 epitope 
(IVTDFSVIK, E4) within EBNA1 (El), or an EBNA1 deletion 
mutant that lacks the internal glycine-alanine repeat region 
25 ( E1AGA) were produced by inserting an oligonucleotide 
corresponding to the E4 epitope into various positions 
within the EBNA1 coding sequence. 

The pBS-ElAGA & the pBS-ElAGA plasmids, encoding the full 
length open reading frame of the EBNA1 gene and the deletion 
30 mutant, respectively were opened at the Ncoll site (genomic 
position: 108067), or the Ncol site, the PflMI site (gp: 
109291), or Bsu361 site (gp: 109510) . For insertion into the 
Ncol site, two oligonucleotides (E1N-E4F: 5 f - 
CATGCCATAGTAACTGACTTTAGTGTAATCAA G-3 ' and E1N-E4R: 5'- 
35 C ATG CTTG ATTACACTT AAGTCAGTTACT AT - 3 ' were annealed in 50 mM 
NaCl, 10 mM MgCl 2 , 50 mM tris-HCl, pH 8.0 at a 
concentration of 10 mM in a volume of 20 jxl by heating to 
97 °C for 5 minutes followed by slow cooling to 50 °C and 
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incubation at 37 °c for an additional 5 minutes. After 
annealing, dGTP was added to a concentration of 100 mM along 
with 3U of T4 DNA polymerase and incubated at 12 °C for 20 
minutes. EDTA was then added to 10 mM, followed by heating 
5 at 75 °C for 10 minutes and dilution to a final volume of 
100 Ml- The oligo was then ligated to Ncol digested pBS-El 
and pBS-ElAGA at a ratio of 100:1, For insertion into the 
PflMI site, oligos E1P-E4F ( 5 ' - 
CGATCGTAACTGACTTTAGTGTAATCAGG-3 ■ ) and E1P-E4R ( 5 ■ - 
10 CCTTGATTACACTAAAGTCAGTTACGATCGTGC-3 1 ) were annealed and 
ligated to the PflMI site of pBS-ElAGA as above. E1P-E4F 
and E1P-E4R were designed with a unique Pvul site for 
identification purposes and can only insert in the proper 
orientation. For insertion into the Bsu3 6l site, oligos 
15 E1B-E4F ( 5 1 -TAACGATCGTAACTGACTTTAGTGTAATCAAGG-3 1 ) and E1B- 
E4R ( 5 1 TTACCTTGATTACACTAAGTCAGTTACGATCG-3 ■ ) were annealed 
and ligated into the Bsu36I site of pBS-ElDGA as described 
above. After electroporation, plasmids containing inserts 
were screened by polymerase chain reaction of minipreps and 
20 sequenced to determine correct orientation and ensure 
complete fidelity throughout manipulations. 

Following isolation of oligo insertion sequences, the El 
and E1AGA open reading frames were excised by BstYI and 
Hindi digestion. The ends of the isolated DNA fragments 
25 were repaired with T4 DNA polymerase and inserted at the 
Smal site of pSCll. Due to the toxicity of the EBNA1 
glycine-alanine repeats for vaccinia virus replication, only 
recombinants containing the pBS-El chimera inserted in 
negative orientation with respect to the P7 . 5 promoter were 
3 0 selected. For pBS-ElAGA chimeras, both positive and 
negative orientation inserts were selected. 

Recombinant vaccinia viruses were generated by transf ection 
of the above-described plasmids into WR-strain-wild-type- 
vaccinia infected TK-143 cells and viral stocks were 
3 5 prepared and titrated in CV-1 cells, as previously described 
(Murray, R. , et al. J. Exp. Med. 176, 157-168, 1992). 

The capacity of recombinant vaccinia viruses to sensitize 
HLA All positive targets to lysis by EBNA4 416-424 epitope 
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specific CTLs was tested according to standard methods. 
Semi-confluent monolayers of fibroblasts form donor QJZ (HLA 
All B13,B51) were grown in 96 wells microliter plates 
(Costar) , infection was carried out for 12 hr at a m.o.i. of 
5 10 in the assay wells in the presence of 3m/ Ci 51 NaCr0 4 
(Amersham) per well. Alternatively, infection was carried 
out for different times starting from 24 hr before addition 
of the effectors. HLA -A 11 restricted EBNA4-specif ic CTL 
clones were obtained by stimulation of lymphocytes from the 
10 EBV seropositive donors BK (HLA A2,A11 B7,B35) with the 
autologous B95.8 virus transformed lymphoblastoid cell line 
(LCL) . Single cell cloning was done by limiting dilution. 

The specificity of the CTL clones for the EBNA4 416-424 
15 epitope was assessed by their capacity to lyse HLA All 
positive PHA blasts preincubated with 10* 1 °M of the 
corresponding synthetic peptide. The cytoxic activity was 
assayed in triplicates in standard 4 hr 51 Cr release assays. 

20 EBNA1 deletion mutants containing the EBNA4 416-4 24 epitope 
inserted at the his39 f pro446, or lys520 positions, relative 
to the B95.8 sequence ( Vacc-E1AGAN-E4 , Vacc-E1AGAP-E4 and 
Vacc-E1AGAB-E4 , respectively) sensitized HLA All positive 
fibroblasts to lysis by EBV-specif ic CTLs (Figure 3A) . The 

25 level of killing was in each case comparable to that 
observed after infection with a vaccinia recombinant 
expressing EBNA4 . In contrast, fibroblasts expressing a 
chimeric full size EBNA1 with the EBNA4 416-424 epitope 
inserted at his3 9 (Vacc-E1N-E4 ) were not recognized by the 

30 CTLs (Fig. 3B) . The differing recognition of Vacc-EIAGAN- 
E4 and Vacc-E1N-E4 infected cells supports the view that 
presence of the gly-ala repeats is directly involved in the 
failure of the E1N-E4 chimera to sensitize target cells to 
lysis, 

35 

EXAMPLE VI 

THE EFFECTS OF OVER-EXPRESSION OF EBNA1 
OK RECOGNITION OF A FOREIGN PROTEIN 
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The following experiments were performed in order to test 
whether over-expression of EBNA1 could influence antigen 
processing/presentation of other antigenic proteins 
expressed in the same target cell. High levels of EBNA1 
5 were expressed in vaccinia virus infected cells through the 
use of an inducible T7 RNA polymerase system (Murray, R., et 
al.J. Exp. Med. 176, 157-168, 1992). HLA All positive 
fibroblasts infected for 12 hr with Vacc-T7 and Vacc-EBNAl 
or, as controls, Vacc-T7 and Vacc-EBNA3 (Murray, R. , et 
10 al.J. Exp. Med. 176, 157-168, 1992) were superinf ected for 
6 hr with Vacc-EBNA4 and then tested for sensitivity to 
lysis by CTLs specific for the 399-408 or 416-424 epitopes 
(Figure 4). Over-expression of EBNA1 or EBNA3 did not 
prevent recognition of the Vacc-EBNA4 infected fibroblasts 
15 although a 30 to 50 % reduction of lysis was observed 
compared to the Vacc-EBNA4 control, probably due to 
competition of the viruses for the cellular 
transcription/ translation machinery. Thus, the gly-ala 
repeat structure does not function as a trans-acting 
20 inhibitor of antigen processing/presentation. 

It is concluded that presence of the Gly-containing 
sequence within the context of the target protein generates 
a signal which either prevents processing MHC class I 
antigen or, alternatively, sequesters the processing 
25 products to a cellular compartment which is inaccessible to 
MHC class I restricted presentation. Thus, failure to 
undergo antigen processing and MHC class I restricted 
presentation affords evasion of the immune system 
independently of the class I type of the host. 

30 

EXAMPLE VII 

INSERTION OF EBNA1 GLYCINE-CONTAINING SEQUENCE 
INTO A FOREIGN ANTIGENIC PROTEIN 

35 The following experiments were performed in order to 
demonstrate that insertion of the EBNA1 gly-ala repeats into 
an antigenic protein prevents recognition by the immune 
system of cells expressing the recombinant protein. A 
recombinant vaccinia virus expressing a chimeric EBNA4 
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protein that contains the gly-ala sequence (Figure 2) was 
produced. The pBS-E4IR plasmid was constructed by inserting 
an EBNA1 fragment extending from the Ncol site (genomic 
position(gp) : 108067) to the Apal site (gp: 109261) 
5 encompassing the internal gly-ala repeat region with 
flanking residues at the amino and carboxyl sides (IR, amino 
acid 39 to 437 relative to the B95.8 EBNA1 sequence) into 
the Msel site of EBNA4 (gp: 97302) between amino acid 
residues trp624 and pro625. A clone of correct orientation 
10 was sequenced to ensure proper reading frame alignment. The 
EBNA4IR open reading frame (E4IR) was excised by EcoRI and 
SstI, the ends were repaired, and ligated into the Smal site 
of pSCll. A clone of negative orientation relative to the 
P7.5 promoter was selected. Recombinant vaccinia viruses 
15 were generated by transfection of the above plasmids into 
WR-strain-wild-type-vaccinia infected TK-143 cells and viral 
stocks were prepared and titrated in CV-1 cells as 
previously described (Murray, R. , et al. J. Exp. Med. 176, 
157-168 1992) . HLA All positive cells infected with the 
20 Vacc-EBNA4 and Vacc-E4IR recombinants were assayed for 
expression of the recombinant proteins and for sensitivity 
to lysis by CTLs specific for the EBNA4 399-408 and 416-424 
epitopes. Judging from the percentage positive cells and 
intensity of fluorescence detected in experiments performed 
25 over a wide range of m.o.i., the recombinant proteins were 
expressed with identical kinetics and at similar levels 
(Levitskaya et al., Nature 375:685-688, 1995). HLA All 
positive fibroblasts infected with Vacc-EBNA4 were 
sensitized to lysis by CTL clones specific for the 399-408 
30 or 416-424 epitopes with maximal lysis after for 6 hrs (Fig. 
5A) at a m.o.i. of 5 (Fig. 5C) . In contrast, cells infected 
with Vacc-E4IR were lysed only weakly, even after infection 
at 6 fold higher m.o.i. or for prolonged periods of time. 
Infection of the QJZsp LCL that carries a Chinese EBV 
35 isolate with selective mutations aborting the recognition of 
the endogenous EBNA4 399-408 and 416-424 epitopes gave 
essentially similar results (Figure 5B and 5D) . It is 
concluded that the presence of the gly-containing sequence 
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within the antigenic protein of interest inhibits processing 
and MCH class I restricted presentation independently of the 
cell lineage of the target cells. The results demonstrate 
that the gly-containing sequence functions as a cis-acting 
5 inhibitor of antigen processing-presentation. 

USE 

The invention provides for avoiding an undesired immune 
response directed to a recombinant protein, e.g., i. 

10 antigenic proteins to be expressed in cells of human or 
other species for gene therapy purposes; ii. antigenic 
structural or regulatory proteins of transfer vectors used 
for the above purposes; and iii. components of vaccines 
which are not intended to be antigenic. 

15 Insertion of a nucleic acid sequence coding for the 
glycine-containing polypeptide, or functional derivatives 
thereof, can be used for modifying genes encoding 
therapeutic proteins, marker genes, regulatory proteins of 
viral vectors, or vaccine components. Therapeutic proteins 

2 0 to be expressed in mammalian cells include but are not 

limited to, e.g., enzymes, cytokines, lymphokines, cell 
adhesion molecules, costimulatory molecules or protein 
products of drug resistance genes or tumour suppressor 
genes. Marker genes include but are not limited to, e.g., 
25 neoR, multidrug resistance gene, thymidine kinase gene, B- 
galactosidase, dihydrof olate reductase (DHFR) and 
chloramphenicol acetyl transferase gene that can be inserted 
into cells together with a therapeutic gene or separately. 
For example, tumour-specific CTLs may contain a marker gene 

3 0 and thus be tagged with the encoded marker protein. This 

embodiment of the invention is useful in those cases in 
which a marker protein is immunogenic and thus may severely 
diminish the therapeutic efficacy of the CTL. Once injected 
into a patient, the progress of the tagged CTL to the tumour 
35 site may be followed via the marker, and the marker itself 
is rendered non-immunogenic according to the invention. The 
marker gene and the therapeutic gene also may be one and the 
same. 
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Structural or regulatory proteins of viruses are often 
immunogenic in patients. Such proteins are often encoded by 
viral vectors that are used or developed for purpose of gene 
therapy; for example, vectors based on vaccinia virus, 
5 adenovirus, herpesvirus or retroviruses. Components of 
vaccines useful as recombinant proteins of the invention 
include those vaccine components which are necessary for 
effective vaccination, but are desired as non-antigenic 
vaccine components, for example, vaccine structural 
10 components such as envelop, capsid, or regulatory proteins 
of viral, bacterial or parasitic infections, or components 
of cancer vaccines. By rendering certain components of the 
vaccine non-immunogenic, it is believed that the host immune 
response will be more effective against the immunogenic 
15 components of the vaccine. 

DOSAGE AND PHARMACEUTICAL FORMULATIONS 
A patient that is a afflicted with viral or genetic 
disease may be treated in accordance with the invention via 
jn vivo or ex vivo methods. For example in vivo treatments, 
20 a recombinant nucleic acid or protein of the invention can 
be administered to the patient, preferably in a biologically 
compatible solution or a pharmaceutical^ acceptable 
delivery vehicle, by ingestion, injection, inhalation or any 
number of other methods. The dosages administered will vary 
25 from patient to patient; a "therapeutically effective dose" 
is determined by the level of enhancement of function of the 
transferred genetic material balanced against any risk or 
deleterious side effects. Monitoring levels of gene 
introduction, gene expression and/ or the presence or levels 
30 of the encoded recombinant protein will assist in selecting 
and adjusting the dosages administered. Generally, a 
composition including a recombinant protein, as described 
herein, is administered in a single dose in the range of 10 
ng - 100 Mg/kg body weight, preferably in the range of 100 
35 ng - 10 Mg/kg body weight. 

Other embodiments will be evident to those of skill in the 
art. It should be understood that the above examples are 
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provided for exemplification only. The scope of the present 
invention is not limited to the above examples, but defined 
in the following claims. 
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Table 3 Cloning of gly-ala repeats into the Sma I site of pGEX-2T 

digestion of pGEX-2T with Sma I: 



BamHI Sma T EcoRI 
GTT£SA1££CCGGGMH£ATCGT 
CAACCTAGGGG CCCTTAAGTAG CA 

GTTGGATCCCC GGGAATTCATCGT 
CAACCTAGGGG CCCTTAAGTAGCA 



11 . 



modification of the ends of the gly-ala polymer with Klenow: 

GGA GCT GGT N...N GCG GGT 

GA CCA N...N CGC CCA CCT C 

GGA GCT GGT N...N GCG GGT GGh G 
CCT CGA CCA N . . . N CGC CCA CCT C 

ill. insertion of gly-ala into the pGEX-2T Sma I site: 

PGEX-2T gly-ala repeat nrFY 

cUrrrlrrrr GGA GCT GGT N. . . N GCG GGT GGA G GGGAATTCkTCGT 

CAACCTAGGGG CCT CGA CCA N...N CGC CCA CCT C cSJJSSJSS 
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Table 4 Cloning of gly-ala repeats into the Rsa I site of the influenza 
A matrix protein: 

i. preparation of the pGEM-matrix plasmid: 
Rsa l digestion 

GAA ACG T AC GTT CTC 

CTT TGC A TG CAA GAG 

ii. isolation and preparation of the gly-ala repeat: 

pGEX-2T gly-ala repeat pGEX-2T 

CTTGGATCCCC GGA GCT GGT N. • .N GCG GGT GGA G GGGAATTCATCGT 

CAACCTAGGGG CCT CGA CCA N...N CGC CCA CCT C CCCTTAAGTAGCA 

Bam HI 

GATC CCC GGA GCT GGT N...N GCG GGT GGA G GGGAATTCATCGT 
GGG CCT CGA CCA N...N CGC CCA CCT C CCCTTAAGTAGCA 

partial filing with A and G nucleotides (Klenow) : 

GATC CCC GGA GCT GGT N . . . N GCG GGT GGA G GGGAATTCATCGT 
AG GGG CCT CGA CCA N . . . N CGC CCA CCT C CCCTTAAGTAGCA 



Eco RI 



GATC CCC GGA GCT GGT N . . - N GCG GGT GGA G GGG 

AG GGG CCT CGA CCA N...N CGC CCA CCT C CCCTTAA 



mung bean nuclease 



TC CCC GGA GCT GGT N...N GCG GGT GGA G GGG 
AG GGG CCT CGA CCA N . . . N CGC CCA CCT C CCC 



iii. insertion of gly-ala encoding repeat into the matrix protein: 

matrix gly-ala repeat matrix 
GAA ACG T TC CCC GGA GCT GGT N . . . N GCG GGT GGA G GG G AC GTT CTC 
CTT TGC £ GGG CCT CGA CCA N. . .N CGC CCA CCT C CCC JG CAA GAG 
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Table 5 Cloning of gly-ala repeats into the Bam H I site of the 
influenza A matrix protein: 

i. modification of the Bam H I site in the matrix protein: 
Bam HI 

AAC GGG GAT CCA AAT AAC 

TTG CCC CTA G GT TTA TTG 

Klenow 

AAC GGG GAT C GAT CCA AAT AAC 

TTG CCC CTA G CTA GGT TTA TTG 

ii. isolation of the Gly-Ala cassette 

pGEX-2T gly-ala repeat pGEX-2T 

GTTGGATCCCC GGA GCT GGT N...N GCG GGT GGA G GGGAATTCATCGT 
CAACCTAGGGG CCT CGA CCA N . • . N CGC CCA CCT C CCCTTAAGTAGCA 

Bam HI/Eco R I digestion 

GATC CCC GGA GCT GGT N...N GCG GGT GGA G GGG 

GGG CCT CGA CCA N...N CGC CCA CCT C CCCTTAA 

partial filing with A and G nucleotides (Klenow) : 

GATC CCC GGA GCT GGT N...N GCG GGT GGA G GGGAA 
AG GGG CCT CGA CCA N...N CGC CCA CCT C CCCTTAA 

mung bean nuclease 

TC CCC GGA GCT GGT N. . .N GCG GGT GGA G GGG AA 

AG GGG CCT CGA CCA N . . . N CGC CCA CCT C CCC TT 

iii. Insertion of gly-ala coding repeat into the matrix protein: 

matrix gly-ala repeat natrix 

GGG GAT C TC CCC GGA GCT GGT N . . . N GCG GGT GGA £_££G AA GAT CCA 
CCC CTA G AG GGG CCT CGA CCA N. . .K CGC CCA CCT C CC? TT CTA GGT 
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Table 6 Cloning of gly-ala repeats into the Stu I site of the influenza 
A matrix protein: 

i. digestion of the pGEM Matrix plasmid with Stu I: 



TTG CAG G 
AAC GTC C 



CC TAT CAG 
GG ATA GTC 



11 . 



isolation of the Gly-Ala cassette 



pGEX-2T gly-ala repeat 

GTTGGATCCCC GGA GCT GGT N . . . N GCG GGT GGA G 
CAACCTAGGGG CCT CGA CCA N . . . N CGC CCA CCT C 



PGEX-2T 
GGGAATTCATCGT 
CCCTTAAGTAGCA 



Bam HI 

GATC CCC GGA GCT GGT N...N GCG GGT GGA G 
GGG CCT CGA CCA N-..N CGC CCA CCT C 



GGGAATTCATCGT 
CCCTTAAGTAGCA 



partial filling with A and G nucleotides (Klenow) 



GATC CCC GGA GCT GGT N...N GCG GGT GGA G 
AG GGG CCT CGA CCA N . . . N CGC CCA CCT C 



GGGAATTCATCGT 
CCCTTAAGTAGCA 



Eco RI 

CATC CCC GGA GCT GGT N . . . N GCG GGT GGA G GGG 

AG GGG CCT CGA CCA N- . . N CGC CCA CCT C CCCTTAA 



mung bean nuclease 

TC CCC GGA GCT GGT N...N GCG GGT GGA G 
AG GGG CCT CGA CCA N...N CGC CCA CCT C 



GGG 
CCC 



iii. insertion of gly-ala coding repeat into the matrix protein: 

matrix gly-ala repeat matrix 
TTG CAG G TC CCC GGA GCT GGT N . . . N GCG GGT GGA G GG G CC TAT CAG 
AAC GTC £ GGG CCT CGA CCA N . . . N CGC CCA CCT C CC£ ££ ATA GTC 
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Table 7 Cloning of gly-ala repeats downstream of the matrix protein: 

i. preparation of matrix DNA by PCR amplification 
product 

ATAAAGCTT ATG AGT N . . . N TTC AAG TGGATCCTCG 
TATTTCGAA TAC TCA N . . . N AAG TTC ACCTAGGAGC 

Hind III-Bam H I digestion of PCR amplified matrix DNA: 

AGCTT ATG AGT N...N TTC AAG TG 

A TAC TCA N. . .N AAG TTC ACCTAG 

ii. isolation of the Gly-Ala cassette 

pGEX-2T gly-ala repeat pGEX-2T 

GTTGGATCCCC GGA GCT GGT N . . . N GCG GGT GGA G GGGAATTCATCGT 
CAACCT AGGGG CCT CGA CCA N...N CGC CCA CCT C CCCTTAAGTAGCA 

Eco RI digestion pGEX- (GlyAla) n 

GTTGGATCCCC GGA GCT GGT N N GCG GGT GGA G GGG 

CAACCTAGGGG CCT CGA CCA N...N CGC CCA CCT C CCCTTAA 

mung bean nuclease treatment of the Eco R I site 

GTTGGATCCCC GGA GCT GGT N...N GCG GGT GGA G GGG 
CAACCTAGGGG CCT CGA CCA N...N CGC CCA CCT C CCC 

Bam K I digestion of pGEX- (GlyAla) n 

GATC CCC GGA GCT GGT N...N GCG GGT GGA G GGG 
GGG CCT CGA CCA N . . . N CGC CCA CCT C CCC 

iii. preparation of pGEM-92f: 

digestion of pGEM-9Zf with Xba I 

N . . . TAGTAAG CTTTGCT CTAGACTGGA . . . N 
N . . . ATCATTCGAAACGAGATC TGACCT . • . N 

mung bean treatment of the Xba I site 

N, . . TAGTAAGCTTTGCT ACTGGA . . .N 

N . . . ATCATTCGAAACGA TGACCT . . . N 

Hind III digestion of pGEM-9Zf 

N • . . TAGTA ACTGGAATTCGTCGACGAGCTCCCTATAGTN . . . N 

N . . . ATCATTCGA TG ACCTTAAGCAGCTGCTCGAGGGATATCAN . . . N 

iv. tandem insertion of matrix and gly-ala into pGEM-92f: 
Hind Ill/Bam H I/blunt ligation. 

pGEM-92f matrix PCR gly-ala 

N.TA AGCTT ATG AGT N. . .N TTC AAG TG G ATC CCC GGA GCT GGT N . . . N 

N.ATTCGA A TAC TCA N . . . N AAG TTC ACC TAG GGG CCT CGA CCA N. . .N 

gly-ala pGEM-92f STOP 

M. . .N GCG GGT GGA G GG G AC TGG AAT TCG TCG ACG AGC TCC CTA TAG N N 
N . . . N CGC CCA CCT C CC C TG ACC TTA AGC AGC TGC TCG AGG GAT ATC N " * N 
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CLAIMS 

1. A method of rendering an antigenic protein invisible 
to the immune system by the insertion in said protein of a 
glycine-rich repeat sequence. 

2. The method of claim 1 wherein the glycine-rich repeat 
sequence is any amino acid sequence containing "n M 
contiguous repeats of the generic repeating unit: 

[Gly a X Gly b Y Gly c Z] 

wherein a is l, 2, or 3; b is 1, 2, or 3; c is 1, 2, or 3; 
each of X, Y and Z is, independently, a hydrophobic or polar 
amino acid without a ring structure and having a side-chain 
of less than 3 atoms; and wherein "n M is an integer between 
1 and 66 inclusive. 

3. The method of claim 1 or claim 2 wherein the glycine- 
rich repeat sequence is the EBNA1 glycine-rich sequence 
shown in Figure 1. 

4. A protein formed by the method of any one of claims 
1 to 3. 

5. A nucleic acid encoding the protein of claim 4. 

6. A nucleic acid encoding the glycine-rich repeat 
sequence of the protein of claim 4. 

7. The nucleic acid of claim 6 which encodes amino acids 
89-327 of Epstein Barr virus (strain B95.8) nuclear antigen 
1 (EBNAl) shown in Figure 1. 

8. A vector comprising the nucleic acid sequence of any 
one of claims 5 to 7 . 
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9. A host cell transformed with the vector of claim 8. 

10. A method for testing for glycine-rich repeat sequences 
which confer on an antigenic protein containing such a 
sequence the ability to evade the immune system, comprising 
incubating a host cell that expresses the protein of claim 
4 with a cell of the immune system that is specific for an 
epitope that is common to said protein of claim 4 and 
reference protein, under conditions which allow for 
immunological recognition of said epitope, wherein said 
epitope is known or determined to be immunogenic in said 
reference protein, and wherein an absence of immunological 
recognition is indicative of evasion of the immune system. 

11. A pharmaceutical composition comprising the protein 
of claim 4 or the nucleic acid of any one of claims 5 to 7 
in combination with a pharmaceutical^ acceptable carrier or 
diluent. 

12. The protein of claim 4 or the nucleic acid of any one 
of claims 5 to 7 for use in therapy. 

13. The use of the protein of claim 4 or the nucleic acid 
of any one of claims 5 to 7 in the manufacture of a 
composition for the treatment of a disease by gene therapy. 
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FIG. 3(A) 

100-1 
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FIG. 4 
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FIG. 5(A) 
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